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ABSTRACT 

The next frontier of Lya forest studies is the reconstruction of 3D correlations from 
a dense sample of background sources. The measurement of 3D correlations has the 
potential to improve constraints on fundamental cosmological parameters, ionizing 
background models, and the reionization history. This study addresses the sensitiv- 
ity of spectroscopic surveys to 3D correlations in the Lya forest. We show that the 
sensitivity of a survey to this signal can be quantified by just a single number, a noise- 
weighted number density of sources on the sky. We investigate how the sensitivity 
of a spectroscopic quasar (or galaxy) survey scales as a function of its depth, area, 
and redshift. We propose a simple method for weighting sightlines with varying S/N 
levels to estimate the correlation function, and we show that this estimator generally 
performs nearly as well as the minimum variance quadratic estimator. In addition, we 
show that the sensitivity of a quasar survey to the flux correlation function is generally 
maximized if it observes each field just long enough to achieve S/N fa 2 in a 1 A pixel 
on an quasar while acquiring spectra for all quasars with L > L^: Little is gained 
by integrating longer on the same targets or by including fainter quasars. We quantify 
how these considerations relate to constraints on the angular diameter distance, the 
curvature of space-time, and the reionization history. 

Key vifords: cosmology: theory - cosmology: large-scale structure - quasars: absorp- 
tion lines - intergalactic medium 



1 INTRODUCTION 

The Lya forest is an established tool for studying struc- 
ture formation, the intergalactic medium (IGM), and cos- 
mological parameters. It has been used to place constraints 
on the linear-theory density correlations at ^ 1 comoving 
Mpc separations, smaller separatio ns tha n other large -scale 



structure probes (^Weinber g et al.l 



1997; 
I2OO5, 



Croft et al. 199S 
,Viel et al...200^ 



iMcDonald et al.l[2000. : Seliak et al. 
In addition, it is our best tool for studying the ther- 
mal history of the intergalactic gas (ISchave et al ] I2OOOI: 



iMcDonald et al.ll200ll : iLidz et al.|[201ol : iBecker et al.H201ll ). 

and it has been used to place a lower bound on the red - 
shift of reionization (|Becker et al.l I2OOII : iFan et al.1 12OO6I ). 
However, past Lya forest analyses have only utilized cor- 
relati ons in the transmissi on within a single sightline 
(e.g., IMcDonald et al.l |2005|) or between a small number 
of sightlines (e.g.. IWilhger et al.1 I2OO0I : iLiske et al.1 I2OO0I : 



iHennawi fc Prochaskal I2OO7I V This approach was justified 
because of the low sky density of quasars in previous wide- 
field surveys. However, the next generation of surveys will 
achieve densities of « 10 — 100 quasars per deg^, allowing 
S/N > 1 on tens of comoving Mpc 3D modes. The ongo- 
ing Baryon Oscillation Spectroscopic Survey (BOSS) on the 

2.5 m Sloan Telescope aims to measure 3D correlations from 

1.6 X 10^ quasars over 8000 deg'^. A similar survey but on a 
4 m telescope, BigBOSS, is anticipated to begin in 20160 In 
general, piggybacking a 3D Lya survey on an optical spec- 
troscopic survey comes with little cost owing to the low sky 
density of z > 2 quasars. 

The advantages of 3D Lya forest correlations have 
been enumerated in several previous theoretical studies. The 
traditional approach utilizing purely line-of-sight measure- 
ments will always be more sensitive to parameters that im- 
pact correlations on less than several Mpc scales. However, 
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unlike with line-of-sight measurements, with 3D correlation 
measurements it is possible to detect the baryon acous- 
tic oscillation f eature at high precision and to use it as a 



tic osculation t eature at high precision and to use it as a 
standard ruler llWhitell200d: [McDonald fc EisensteinI [200?! : 



ISlosar et al]|2009l : IWhite et aLlbOlof T Such a measurement 
at = 2 — 3 with a BOSS-like spectroscopic quasar survey 
has the potential to rule out early dark energy models and 
to place 10~^-level constraints on Q,k, the z = spatial 
curvature density (jMcDonald fc EisensteinI 120071 '). Three- 
dimensional measurements are also more sensitive to large- 
scale inten sity and temperature fluctuations than ar e line-of- 
sight ones (jWhite et al.ll2010l : [McQuinn et al.ll2010l ). In fact, 
the resulting temperature fluctuations from physically moti- 
vated models for He ll reionization can change the Lya forest 
correlation function by 0(1) at 100 Mpc separations. This 
difference would be easily detected in a measurement utiliz- 
ing correlations between sightlines, but woul d have gone un- 
detec ted in previous line-of-sight analyses (jMcQuinn et al.l 
l2010h . In addition, a future Lya forest survey that is sensi- 
tive to z « 4 could detect th e relic temperature flu ctuations 
from hydrogen reionization ([McQuinn et al.ll2010l ). 

This paper studies the ability of spectroscopic quasar 
surveys to constrain correlations in the 3D Lya forest. Sec- 
tion [2] presents formulae for the sensitivity of spectroscopic 
surveys to 3D correlations as well as for how to weight 
sightlines based on their signal-to-noise properties. We use 
the resulting expressions to understand the sensitivity of 
both purely hypothetical and proposed quasar surveys as 
a function of depth, volume, and redshift. Section [3] dis- 
cusses how the suggested weights relate to the minimum 
variance quadratic estimator. Section [4] discusses additional 
survey considerations: (1) whether taking the spectra of 
high-redshift galaxies in addition to quasars can enhance 
the sensitivity, (2) the advantages of cross correlating a 3D 
Lya survey with another survey of large-scale structure, (3) 
how to optimize a survey's strategy to minimize the uncer- 
tainty in its estimate of the correlation function, and (4) 
how the previous considerations translate to constraints on 
cosmological parameters and interesting astrophysical pro- 
cesses. Finally, Section [S] discusses the severity of continuum 
subtraction errors, of errors in the mean flux estimate, and 
of contamination from damping wings. We flnd that these 
systematics are likely to be less severe for upcoming 3D Lya 
forest analyses than they were for past line-of-sight studies. 

We assume a flat ACDM cosmological model where nec- 
essary with = 0.27, h = . 71, a s = 0.8, = 0.96, 
and = 0.046 ([Komatsu et all [201 ll ). We henceforth wiU 
use "Mpc" as shorthand for "comoving Mpc," and we use 
the standard Fourier convention for cosmological studies in 
which 2 7r's appear under the dk's. Table 1 provides relevant 
numbers for the Lya forest that are used in our calculations. 



2 SENSITIVITY TO 3D FLUX POWER 
SPECTRUM 

2.1 Covariance 

Let us take a quasar survey that provides A'^ Lya forest 
spectra at redshift z. It provides these Lya forest spectra 
towards the locations xj_^n on the sky, where n G {1, A^}. 
We treat the line-of-sight direction as continuous for conve- 



Table 1. Relevant numbers and conversion factors: x is the con- 
formal distance. Ax is the conformal distance covered by the Lya 
forest that is not contaminated by the Ly/3 forest, and b is the 
large-scale bias of the Lya forest in our m odel. The mean flux, 
{F), is calculated using the fitting formula in lMeiksinl l l2009l') . and 
dX/dx and dv/dx are in units of A Mpc~^ and km Mpc~^, 
respectively. 

z X [A] (F) Ax [Mpc] dX/dx dv/dx \b\ 



2.0 


3647 


0.88 


782 


0.82 


67 


0.12 


2.5 


4255 


0.80 


740 


1.01 


71 


0.18 


3.0 


4863 


0.70 


701 


1.22 


75 


0.27 


3.5 


5471 


0.58 


666 


1.45 


79 


0.37 


4.0 


6078 


0.44 


635 


1.69 


83 


0.55 



nience such that the survey measures the Lya forest in a 
spatial window given by 

JV 

W(x) = - ' (1) 

n = l 

where n = N/A, A is the survey area on the sky, and 
X = (a;||,a;j_). We use an analogous convention for the 
Fourier wavevector, fc. Thus, the survey measures (5(a;) = 
5f(x) Wix), where 5p(cc) = Fj (F) — 1 is the overdensity in 
the transmission fraction F , where {F) is the mean trans- 
mission. Switching to the Fourier basis, the covariance of 
two (5-modes is 



(5(fc)5(fc')*) = n"' (27r) &°{k\\ - fc||) 

JV JV 



(2) 



(2^) 



Pp(fc||,fcj 



where (...) represents an ensemble average, Pn.ti is the ID 
noise power for skewer n (which we assume does not cor- 
relate with other sightlines), 5™ is the Kronecker delta, 
tildes signify the Fourier basis, and w„(fc||) is the weight 
given to the mode for skewer n and normalized such that 
X]„ '"'"('^11 ) ~ 1- We will often keep the k\\ depen- 
dence of vjn implicit for notational simplicity. This weighting 
scheme assumes that the combined weight of pixels n and 
m factorizes. While not fully general, with the proper choice 
of Wn, this scheme is close to optimal (Section [3)). Lastly, 
equation ((2)1 approximates the modes as continuous in the 
transverse direction, as would occur in the limit A <x. 

The noise power that appears in equation ^ can be 
related to the [S/N]x on the continuum in a pixel in a res- 
olution element of size X: 

Ax 



PN,n = {F}-'[s/Nr^i 



IMpc 



= 0.8(F)-[S/iV]- (^Ui±: 



-3/2 



(3) 
(4) 



where AA is the pixel size in wavelength, and we have as- 
sumed that the noise is white. We will henceforth discuss the 
noise in terms of the S/N in 1 A pixels. However, for the tens 
of Mpc modes of interest for 3D Lya forest analyses, resolv- 
ing 1 A may not be required. A resolution of tens of A suffices 
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to capture the scales where 3D correlations can be detecte d 
for upcoming measurements l|McDonald fc Eisensteinll200'i1 ). 
It is simply a convenience that leads us to quote the S/N 
per 1 A pixel. 

Equation ((Sjl is the covariance of the Fourier-space flux 
overdensity field, 5f, convolved with the survey window, W . 
In the limit of large A'^, W becomes a 5-function such that 
5 « 5p. The covariance of two modes in the flux field can be 
approximated in this limit as diagonal with entrie^ 



Ptot = (141') = PF(fe) + n"' [Pn + Plos(fc||)] 



(5) 



where X indicates the average of quantity X„ over A'' skew- 
ers. 



-Plos(fc|| 



and 



i 



(2^ 



El 



ft(fc||,fex), 



(6) 



(7) 



The function Pios is just the line-of-sight power spec- 
trum that is typically me asured in cosmologica l stud- 
ies of the Lyg forest ( e .g., McDonald et al.l |2005| ) . As in 
iMcDonald fc Eisenstei 3 (|2007i r we will refer to this term in 
equation ([6]) as the "aliasing" term. Equation ((5| is derived 
by noting that the sum over A'' skewers is a monte-carlo 
evaluation of the integral, and the monte-carlo integral of 
a plane wave leads to a (S-function to precision of N~^^^. 
Replacing these summations with 5-functions results in a 
two 5-function term from the components that have m 7^ n 
(that yields Pp) and a one 5-function term from m — n (that 
yields Ptot — Pf). Equat i on ([5| agrees with equation (13) in 
McDonald fc Eise nsteinI (|2003) and with equation (A6) in 
White ct al. (2010,) 

Henceforth we ignore correlations between flux pixels in 
a single line-of-sight, which contain a small fraction of the 
total information. This omission is equivalent to enforcing 
the condition that m 7^ n in the summation in equation ([2]). 
We deflne the estimator Pp = |5fe|^ for this case. Dropping 
terms suppressed by higher powers of A*', the ensemble av- 
erage of Pp is 



(Pp(fe)) =Pp(fc). 



(8) 



Thus, omitting line-of-sight correlations results in an unbi- 
ased estimator at lowest order. While Pp is biased to frac- 
tional order N~^^'^ , the noise in this estimate can be cor- 
rected because the known as shown in Section O 



^ The largest of the off-diagonal terms is Pp VARfui^/Af], where 
the "VAR" function represents a number whose probabiUty dis- 
tribution function has a variance given by the argument. Its exact 
value is determined by the x±^n. Even though there are A^^ pos- 
sible correlations, the error from neglecting the off-diagonal terms 
scales as N~^^'^ rather than because there are only quasar 
locations. 

As a consistency check, it is simple to show that (x^i^w)) = 
Pp /Ptot is ^ A'^, where the sum is over all wavevectors with 
the same fcy. This incquaUty demonstrates that the amount of 
information that can be extracted from 5p (as measured by the 
significance at which 5p can be detected) is less than or equal to 
the number of Lyo forest pixels in a survey. 



An additional advantage of omitting line-of-sight correla- 
tions is that the contamination from continuum fluctuations 
and damping wings primarily induce this type of correlation 
(Section [5]). 

The covariance of Pp(fe) is 



COv[PF(fe), Pp(fe') 



" ' (2^)4 ^ 



+ e 



'a!fc-{fe'^+fe^') 



X w^wt [PF(fc||,fcW) + PN,„<5i(2^)2 5^(fc(_^))] 

X t&fc™;; [Pp(fc||,fcf ) + PN,„<5^(27r)'5^(fcf )] , (9) 

where all 3D wavevectors that appear are evaluated at fcy, 
and we have assumed Gaussianity and, thus, dropped the 
connected 4th moment. While the small-scale modes in the 
forest are far from being Gaussian, Gaussianity is likely to be 
a decent approximation at the > 10 Mpc wavelength modes 
of interest for 3D analyses. With a bit of algebra analogous 
to how equation ((5| was derived, this equation simplifies to 

COv[Pp(fc),Pp(fe')]fc|| = ^PtltSk + ^^PF(fc)PF(fc') (10) 

+ / PF(fcn, feV")Pp(fcn, fe^" - fex - k'j_) 



+ ^u,2 ^-1 J^_^Pp(fcjj^feW)J'p(fcii,feV'' +fc-L -feV)- 

For the wavevectors and quasar number densities of in- 
terest, the off-diagonal terms in equation pO|l are unim- 
portant. The subtlety here is that, unlike the diagonal 
terms, these terms do not average down linearly with the 
number of independent modes when binning the estimates 
for Pp in a shell in fc-space. This is analogous to the 
well-known behavior of the 4-point function in surveys of 
large-scale structure (|Meiksin fc Whit e 1999), which has 
become known as "bea t-coupling" (|Hamilto n et al.l I2OO6I : 
I Rimes fc Hamiltonl bOOe''). The most important off-diagonal 
contribution comes from 4Ar-ipF(fc)PF(A;'). At fixed 
this becomes comparable to the corresponding diagonal term 
when the number of binned pixels in the shell is comparable 
to N. This criterion is satisfied for larger wavevectors than 
fcx = 0.06[n/10~3Mpc-2]^''2 x y/k±/Ak± Mpc'S where 
Ak± is the size of the bin in the k± direction. However, 
for the wavevectors and number densities of relevance, we 
find that in practice this term is always subdominant to the 
aliasing term, Pp, in Ptot. The inclusion of these terms 
is however formally important in that they serve to "cap" 
the total information content of a survey, resolving an issue 
raised in McDonald fc Eiscnstoin (2007). The off-diagonal 
elements are a more important consideration when line-of- 
sight correlations are included. 

Quasar clustering was not accounted for in the previous 
expressions. While clustering does not bias Pf when line-of- 
sight correlations are omitted (at least at quadratic order in 
the density), it does increase the variance of the estimate. 
Clustering results in the term (Pn + Pios) in Ptot gaining 
the factor 1 -I- Cq{k±) n, where Cq{k±) is the angular power 
spectrum of the sources that contribute at that redshift. 
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Table 2. Sloan Dig ital Sky Surve y constraints on P]os in units of 
Mpc from McDonal d et alj l|2005l 'l. 





z = 2.2 


z = 2.6 


z = 3.0 


z = 3.6 


z = 4.0 


0.15 


0.27(2) 


0.43(2) 


0.58(4) 


1.05(10) 


1.47(22) 


0.20 


0.26(1) 


0.38(2) 


0.58(3) 


0.86(6) 


1.08(13) 


0.30 


0.18(1) 


0.30(1) 


0.44(2) 


0.81(5) 


0.85(10) 


0.50 


0.15(1) 


0.24(1) 


0.35(1) 


0.59(3) 


0.81(07) 



Table 3. Value of Vn = (l + f'N,n/fios)~^ (cf-, eqn. ll3l l at several 
redshifts as a function of the S/N on t he continuum in a 1 A pi xel. 
This calculation uses the estimate of iMcDonald et al.l ||2005| ) at 
k = 0.0014 s km-i for P|„,. 



[S/N]iA 


z = 2.2 


z = 2.6 


z = 3.0 


z = 3.6 


2 = 4.0 


0.50 


0.04 


0.06 


0.08 


0.10 


0.11 


1.00 


0.14 


0.20 


0.27 


0.31 


0.33 


2.00 


0.40 


0.50 


0.59 


0.64 


0.66 


5.00 


0.81 


0.86 


0.90 


0.92 


0.93 


10.00 


0.94 


0.96 


0.97 


0.98 


0.98 



This correction is significant on scales where the clustering 
power is comparable or larger than the shot power. Assum- 
ing a quasar bias of 3 at z = 3 and that each Lya spectrum 
yields 500 Mpc of absorption, the two become comparable 
for ft ^ 10"^ Mpc"^ (and then only at kj_ 0.02 Mpc"\ 
the horizon scale at matter-radiation equality). This n is an 
order of magnitude higher than what upcoming surveys will 
achieve. 



2.2 Optimal Weights 

The weight that maximizes the signal-to-noise ratio, 



P|/var[Pp] 



™„(fc||) = (Plos(fc||)+PN,n)" 



(11) 



where S(fc||) is fixed by our normalization convention and 
absorbs all factors that do not depend on n, and this maxi- 
mization neglects the subdominant off-diagonal covariances 



so that var[PF] = 2 Ptot. (We show in Section[2]4]that a sim- 
ple generalization of these weights can be easily applied to 
real data.) This choice of weights results in Ptot becoming 



Ptot(fe) = Pp(fe) + nj PioB(fc||), 



where 



"off 



1 

A 



Pios{k\i 



-Plos(fc|| 



N.n 



(12) 



(13) 



Thus, a single number, Uas, characterizes the sensitivity of 
a Lya forest survey to Pp, and Vn is a measure of the impor- 
tance of each quasar on a scale of to 1. While fiefi depends 
on fcy, in practice this dependence is likely to be weak be- 
cause P\oB{k\\) has roughly a white noise power spectrum at 
relevant wavevectors and Wcff also depends relatively weakly 
on Pios(fc||). The constancy of Pios at k ^ 0.5 Mpc~^ is quan- 
tified in Table [21 which tabulates measurements of Pios from 
iMcDonald erahl l|2005h at several fcy and redshifts. Con- 
stancy should be an even better approximation at smaller 



Table 4. ncfp for a quasar survey in units of 10~^ Mpc~^ assum- 
ing that rrtj^g = and fc = 0.1 Mpc~^ . These n umbers are cal- 
culatcd using the B-band luminosity function from lHopkins et al.l 
(2006). The second number in select entries is this but also us- 
ing galaxies in addition to quasars. The bottom row provides the 
factor that converts n^ff in each respective column to units of 
deg~^. 



1 A 


2 = 2 


2 = 2.5 


2 = 3 


2 = 4 


2 = 5 


21 


0.26 


0.16 


0.07 


0.01 


0.00 


22 


0.82 


0.54 


0.27 


0.04 


0.00 


23 


2.0 


1.4 


0.78 


0.11 


0.01 


24 


3.8/5 


2.8 


1.7 


0.32 


0.04 


25 


6.4/18 


4.8/9 


3.2/5 


0.77 


0.10 


26 


9.8/66 


7.4/47 


5.1/28 


1.5/4 


0.24 




8.4 


11 


13 


16 


19 



0.010 
0.005 



I 



Ph 0.001 

S 5x10-4 



IK 1x10-4 
5 X 10-5 



20 




m 



lA 
AB 



Figure 1. Solid curves show the effective number density of 
quasars contributing Lya forest spectra at redshift z (eqn. I13|l as 
a function of the B-band AB magnitude that has [S/N]ia = 1, 
"^AB- have also assumed that m^g is equal to the limiting 
magnitude of the survey. The dashed curves are the actual num- 
ber of quasars brighter than rr iAB- These curves ar e calculated 
for fc|| = 0.1 Mpc~^ using the iHopkins et al. luminosity 
function and assuming that S/N ocflux. 



fc|| than is tabulated. For a BOSS-like Lya forest survey 
at 2 = 2.5, a factor of 2 smaller Pios(fc||) from its small-fc|| 
asymptote results in only a factor of 1.4 decrease in notf. 
The decrease is even smaller for a deeper survey. 

The gains in sensitivity to Pp are meager from improv- 
ing the S/N on a quasar once Pn.u < Pios, which corre- 
sponds roughly to [S/N]ia ~ 2. Table |3]quantifies this state- 
ment by giving Un as a function of the S/N at 1 A, using the 
estimates of Pios at the smallest k quoted in [McDonald et al] 
(2005) of 0.0014 s km~^. However, these numbers should be 
applicable over a wide range of k owing to the form of Pios- 
Even though the amount of power in the forest increases 
with redshift, the S/N requirements at fixed source flux 
remain nearly constant with increasing redshift (becoming 
slightly less stringent). 

Figure[T]shows fies at three redshifts as a function of the 
B-band magnitude at which a survey obtains [S/N]-i_a = 1. 
For this and subsequent calculations, we assume that Pp has 
the simple form 

PF{k,z) = b{zf{l + gfiYPt{k,z) exphfcj|/fc^], (14) 
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where P]'" is the hnear-theory density power spectrum and 



fcn = mp/[k-BT\. We assume T = 20,000 K isothermal gas 
such that fco = .08 s~^ km, and is cahbrated so that 
Pios matches the iMcDonald et^ (|2005l ) measurements at 
0.0014 s"^ km (see Table 1). The factor [l + g owes to 
peculiar velocities, where /j, = n ■ k and n is the unit vector 
along the lin e-of-sight. set q = 1 , a choice motiv ated by 
the results of lSlosar et"all (12009') and McQuinn ot al. ('2010|), 
but caution that there is at pr esent theoretical unce rtainty 
in this choice at the 50% level (iMcQuinn et al.ll2O10h . 

The solid curves in Figure [T] are calculated using the 
iHopkins et all (|2006l ) B-band luminosity function and as- 
suming that each quasar contributes ~ 500 Mpc of Lya 
absorption (specifically the forest between 1041 and 1185 A 
in each sightline). Thus, these curves are the effective num- 
ber density of quasars projected over this distance, which 
can be thought of as the number of quasars that contribute 
Lya forest spectra at the stated redshift. These curves also 
take the survey limiting magnitude ttiab to be equal to the 
magnitude where quasars have [S/N]ia ~ 1, "Iabi choice 
we discuss below, and assume that S/N ocflux. This scaling 
corresponds to the sensitivity being sky- or dark current- 
limited, a choice motivated by the fact that this scaling mat- 
ters most for the faintest objects that are observed. However, 
the value of fieff is not significantly changed if rather we as- 
sume S/N oc fiux^/^ as would occur if the observations were 
photon-limited. The solid curves from top to bottom are fica 
for z — 2, 3, and 4. These curves illustrate that it will be 
challenging to (1) measure the 3D Lya forest at z > 3 be- 
cause of the falloff in the total abundance of quasars or (2) 
obtain ficB 2> 10""^ Mpc~^ at any redshift because of the 
shallow faint-end slope of the luminosity function. Table 3] 
tabulates fica as various redshifts and limiting magnitudes, 
and it also gives the conversion from Mpc~^ to deg~^ units. 

The dashed curves in Figure [1] show the total number 
of quasars brighter than mAB- The effective number density 
at m^'B is always a factor of a few smaller than the total 
number of quasars, with the difference decreasing with red- 
shift. See Appendix!^ for an analytic understanding of this 
suppression for power-law luminosity functions. 

BOSS will achieve a B-band magnitude limit of m^B ~ 
22 at 2; = 2 (m^B ~ 21 at z = 3) and also achieve « 
22. These numbers yield n^s = (0.3 - 0.8) x 10"^ Mpc"^ 
[4 — 7 deg~^] at z — 2 — 3. BigBOSS aims to achieve one 
magnitude fainter than BOSS, which results in n^ff = (1 — 
2) X 10"^ Mpc"^ at 2: = 2 - 3. For these Ucft and at A: > 
0.1 Mpc~^, S/N scales linearly with ficft such that BigBOSS 
will be a few times more sensitive than BOSS. 

Figure [2] is a contour plot of fics as a function of m^B 
and jtia'b- The solid contours are calculated assuming fcy = 
0.1 Mpc"^ and are labeled in units of 10"^ Mpc"^. These 
curves illustrate that is maximized roughly when m^a ~ 
"T-AB —0.5. Little is gained by observing fainter quasars than 
rriAB —0.5 or by integrating longer on the same quasars once 
rriAB ~ ttiab + 0.5. The dashed contours are the same but at 
0.5 Mpc~^ (at which Pios is a factor of 0.6 smaller in 



k 



our model), demonstrating that the fcy dependence of these 
conclusions is weak. 

How much is gained by weighting by Wn relative to a 
uniform weighting scheme? Let us take a luminosity with 
power-law index —2, a form discussed in Appendix [X] We 
assume that S/N oc flux, PN.iim/Pios ~ 1 (equivalent to 
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Figure 2. Impact of varying the limiting magnitude of tiie survey, 
mj^g, along with the magnitude at which IS/N]ia = 1, ™ab' 
The solid curves are contours of constant fieff evaluated at fcy = 
0.1 Mpc~i and z = 2.5, with the labels in units of 10~^ Mpc"^. 
The corresponding dashed curves are the same but evaluated at 
fcii = 0.5 Mpc" 
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Figure 3. 3D hya forest effective volume divided by survey 
volume at z = 3, and for n^ff = 10~^, 10~^, 10~^, and 
10~* Mpc~-^ in order of increasing dash length (decreasing am- 
plitude). The plotted quantity is equal to (2Pp^/5Pp^)^ and is 
essentially independent of redshift at the plotted scales. 



[S/N]iA ~ 2; Table O, where PN,iim is the noise power 
spectrum for quasars at the survey limiting magnitude. 
This case results in only a 4% improvement in the value 
of Ptot — Pp relative to a simple uniform weighting. For 
PN,iim/Pios = 0.5(0.3), the improvement is 40% (60%). 
Thus, weighting only offers a significant improvement when 
a large fraction of the quasars have Pn.h/Pios < 1- 



2.3 Effective Volume 

One can define an effective volume for Lya forest sur- 
veys analogous to the effective volume in galaxy surveys 
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l|Feldman et al.|[l994l ). In particular, in the galaxy survey 
case the effective volume is 



Kff,g(fe) = V, 



which becomes in the Lya forest survey case 



V,s{k)=AL 



PF(fe) 



(15) 



(16) 



In equation (|15[) and (I16|) . V^g is the actual volume of the 
galaxy survey, Pg is the galaxy power spectrum, rig, 30 is the 
number density of galaxies, and L is the line-of-sight dimen- 
sion of the Lya survey. The major difference between these 
two effective volumes is just that in the Lya case the shot 
noise is in the plane of the sky and this term is modulated 
by the line-of-sight power. 

Figure [3] plots a generalization of the the effective vol- 
ume, Veft^,,{k)/[AL], for ncff = 10"\ 10"^ 10"^ and 
10~* Mpc~^, in order of increasing dash length. We de- 
fine Kfr,^(fe) to be Voff(fc) but averaged over a shell in k- 
space such that Vca,^{k) = 2 AL ^/ 5P^ ^, where Pf,/j = 
Pf/(1 + 5M')', 



5Pf 



-1/2 



(17) 



and var[PF(fc)] = cov[PF(fc), PF(fc)]. Here, 5Pf,ii has been 
defined so that SPy.^/M^^'^ is the precision at which Pf,h 
can be measured in a fc-space shell with Nk independent 
modes. 

The value of Vcfi,i_i{k)/[AL\ is near unity when a survey 
is sample variance-limited. For a BOSS-like quasar survey 
in which ficff — 10~^ Mpc~^ at 2 < z < 3, this quan- 
tity reaches a maximum of 0.5 at the smallest wavevec- 
tors and falls off rapidly at > 0.1 Mpc~^. Even n^g = 
10"^ Mpc"^ - roughly the maximum density that can be 
achieve in a quasar survey - is not sample variance-limited 
at fc > 0.1 Mpc~^. Figure [3] also emphasizes that n^g « 
10""^ Mpc~^ marks the critical number density at which a 
survey becomes sample variance- limited at fc < 0.1 Mpc~^, 
such that the gains from a 3D analysis become large. 

The plotted quantity in Figure [3] can also be related 
to the S/N of quasar survey to Pf,/j in a fc-space shell of 
width Afc. This shell contains Mu = k^ Ak A L/{2tv'^) inde- 
pendent samples, so that the S/N ratio at which Pf.^ can 
be measured in this shell equals 



Pf,m/o"f,m = 60 



10-2 
1.2 



1/2 



A 



IGpc 103 deg^ 



jf_\Ak/k\_ 
10-3 Mpc- 



1/2 



1/2 



(18) 



where crp^^ = 5Pp^f^{k)/J\fl^^ and the redshift dependence is 
approximate. 



2.4 Estimating Correlations on Real Data 

In practice, it will be easier to measure 3D Lya correlations 
in configuration space via the 3D correlation function. In 
configuration space, the Fourier-space weights proposed in 
Section 12.21 become the line-of-sight convolution of w„ (r) 



- the Fourier Transform of ■Wn{k^^) - with the flux field of 
sightline n. Conveniently, Wn{r) will be relatively localized 
in real-space because w„(fc||) ~ Wn{0) for fcy < 0.5 Mpc~^. 
However, for estimates of the correlation function, a simpler 
weighting for each sightline of Wn(0) is likely sufficient. In 
fact, this weighting is identical to the full weighting to the 
extent that Pos and the PN,n are white. We find that var[PF] 
is not significantly increased with these simpler weights. 

Real data will be more complicated than the idealized 
case of uniform noise that we have considered thus far. 
For example, there will be high-noise regions in the spec- 
tra owing to sky lines, and there may also be holes in the 
data where damped systems have been excised. We have ex- 
pressed our weights in terms of the noise and the Lya forest 
power spectra. However, the suggested weights are equiva- 
lent to simply weighting each sightline by (1 -I- o"n/''"ios)~^i 
where and af^^ are respectively the variance of the noise 
and the Lya forest, smoothed on a large enough scale that 
Pos is white. This scale corresponds to > 10 Mpc, and we 
find that the exact choice of the smoothing scale weakly af- 
fects var[PF]. Such weights can more readily be applied to 
real data. 



3 QUADRATIC ESTIMATOR 

This section derives the minimum variance estimator that 
is quadratic in 5p and compares it to the estimator derived 
in the Section [2] To proceed, we decompose the covariance 
of the flux overdensity at fcy into a component that depends 
on the parameter we aim to measure, Pp{k), and one that 
does not. Namely, 



Cnm 4* Pf (^) ^n-n 



(19) 



where Sf^„ is the Fourier transform of the flux along sightline 
n and we consider a single fcii , 



= P{k\\,rr, 



Afc, 



2tv 
dk± 
'2^ 



exp[ifcx • rnm], 

fc± PF(fc||, fcx) Jo(fc± rn 



(20) 
(21) 

(22) 



P(fc|| , Vjirn, ) ^ 

Here, r^m = l^'nml is the transverse separation between 
sightlines n and m, Afcj_ is the width of the Fourier space 
pixel in which we estimate Jt(fc), and equation (|22|l assumes 
that Pp(fe)X„„ < P(fc||, 

The minimum variance estimator for PF{k) that is 
quadratic in 5F,n and unbiased is given by 



pQE 
^F 



inm OF,nOF,m 



•tr[QC], 



(23) 



where 

Q= (tr[C"^XC"^X])"^C"^XC"\ (24) 
Equation (|24p is derived by minimizing the variance of 



Pp^ assuming Sf is Gaussian so that 



[P«-] 



2tr [CQCQ], 



(25) 



and subject to the condition that it is unbiased so that 
tr[QX] = 1. Note that equation (|25p does not include sam- 
ple variance, which would contribute the term 2 Pp. 
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Our aim is to understand how the weights suggested 
in the previous section relate to the above minimum vari- 
ance quadratic estimator. To proceed, we set the off diagonal 
terms in Cnm to zero (i.e., P{k\\,rnm) — > for r„m / 0). 
The estimator is still unbiased with this approximation. The 
motivation for this approximation is that the typical sepa- 
ration of quasars in such surveys is r±_ > 10 Mpc. At these 
r_L and at relevant fey, P(fc||,rx) is down by a factor of > 4 
from its r_L = value. In this "zeroth-order" approximation, 
Qnm becomes 



Si nm 

where Pios 



/ 2-K 



exp[ik±_ ■ r„rn] 



(Plos + fN,n) (Plos + Pn 



(26) 



P(fc||,0) and 



A = (X^(Pio. + PN,n)"' (^'ios + Pn,™)"' , (27) 

^^(Plos + PN.n)"') . (28) 



The last line assumes that the noise is uncorrelated between 
sightlines and neglects the terms with m — n, whose frac- 
tional contribution vanish as A'^ — oo. The final simplifica- 
tions makes it apparent that this weighting is identical to 
that derived in Section 12.21 where we had implicitly made 
the same assumptions. The estimator in Section E^ is biased 
at the N~^^^ level. Subtracting the term tr[Q(°'C] from 
X],im Qnm (Jp.n^F.m in equatiou (|23|) corrects for this bias. 

Ignoring the tr[Q''''C] correction, the corresponding es- 
timator for measuring the correlation function ^(r) follows 
from taking the Fourier transform of Q^m 5F,nSF,m and, with 
the approximation in equation 



ance of Pp^ and that of the minimum variance quadratic es- 
timator Pp^, both calculated using equation (|25p but with 
the appropriate Q. The top panel fixes fcy at 0.1 Mpc~^ and 
varies k±, and the bottom panel sets fcy = k±. The solid 
curves in the top panel show the fractional difference in the 
zero- noise limit (Pn.ti = 0), which is the case where the 
fractional difference is largest. In order of increasing thick- 
ness, the solid curves represent n — 10""^, 3 x 10""^ and 
10~^ Mpc"'^. At the smallest k± shown in the top panel, 
the estimator variance for P^"' is only 6% larger for the 
case with n = 10~^ Mpc~^ (thin solid curve). This differ- 
ence becomes 10% and 20% for n = 3 x 10"^ Mpc"^ and 
10~^ Mpc~^, respectively (thicker solid curves), and it in- 
creases with k±. These estimates for the fractional increase 
in variance do not include sample variance, which would fur- 
ther decrease the plotted fraction (especially for the highest 
n and smallest k). 

The red dashed curves in the top panel of Figure [4] are 
the same as the black solid curves except that they include 
noise. In particular, these curves assume a power-law lumi- 
nosity function with slope —2, z — 2.5, Pn.u ~ 1 Mpc~^ 
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where ★ signifies a convolution along the line-of-sight. This 
estimator is identical to that derived in a different manner 
in Section \TM 

FigureUlplots the fractional difference between the vari- 



Figure 4. Top panel: Fractional increase in variance for the Pp"' 
estimator relative to the minimum variance quadratic estimator, 
Pp^, as a function of kj_. In order of increasing thickness, the 
curves are for n = 10~^ , 3 X 10"'^, and 10~^ Mpc~^. The black 
solid curves are the noiseless case, and the red dashed curves 
include noise as described in the text. These calculations assume 
fc|| =0.1 Mpc"'^, are performed in a 200 Mpc X 200 Mpc region, 
and do not include sample variance. Bottom Panel: The blue dot- 
dashed curves are the same as the red dashed curves in the top 
panel but with fcy = fcj^. 



where quasar n is at the limiting magnitude of the survey, 
and S/N oc flux. These choices result in fieff being reduced 
by a factor of 0.7 compared to the noiseless case, and the 
red dashed curves are suppressed relative to the black solid 
curves by a comparable factor. We find a similar correspon- 
dence for other noise models. 

One can improve upon our crude diagonal approxima- 
tion for C in an iterative manner. In particular, maintain- 
ing only the diagonal elements in C when inverting can be 
thought of as the lowest order approximation for C^^ in 
equation (|24|l . There are iterative methods that can be ap- 
plied to achieve higher order corrections (such as the Jacobi 
method, Neumann iteration, or the Gauss-Seidel method). 
We advocate for the Gauss-Seidel method here because it is 
guaranteed to converge since C is Hermitian, and we have 
indeed verified that it converges. This method yields after 
the i*'^ iteration the estimate for the inverse of C given by 

7-[7[Cgs^)]-^), (30) 

where L and U are the lower and strictly upper diagonal 
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Figure 5. n^ff as a function of for a Lya forest survey 

that uses either galaxies or quasars. These curves assume mj^g = 
'"ab' where mJ^B survey hmiting magnitude. The soUd 

curves show n^f( for quasars at z = 2,3, and 4 (from top to 
bottom). The dashed curves show this quantity for galaxies at 
^ = 2,3, and 4 (from left to right). Here, n^ff is calculated at 
fcy = 0.1 Mpc~^, but its fc|| dependence is extremely weak. 



left to right), assuming that m^B = wT'Ab- These curves 
are calcula ted from the luminosit y function of Lyman-break 
galaxies in iBouwens et al. I (|2007l ) for 2^3, assuming that 
there is no evolution i n the ultraviolet lumin osity function 
between z = 2 and 3 (iReddy fc Steidell 120091 '). The galaxy 
magnitudes are for rest-frame 1600 A. The solid curves are 
the same but for a quasar survey at these redshifts. Galaxies 
begin to aid the sensitivity when ttiab ~ 24 at z = 2, when 
m]^ « 25 at z = 3, and when m^'e ~ 26 at z = 4. Including 
galaxies results in a dramatic increase in fics once a survey 
pushes below these stringent magnitude thresholds (Table 
Ij. 

The non-smooth continuum of galaxies' spectra will add 
an additional source of noise, but will not bias the estimate 
of Pp as long as correlations within a single sightline are 
neglected and the mean continuum can be removed (and 
even these mistakes can be isolated; Section ??). If the power 
in galaxies' continuum spectra is smaller than n/fias x -Pios, 
using galaxies in addition to quasars has the potential to 
significantly improve a survey's sensitivity. 



components of cQ We initialize the iteration with Cgg = 
Cnm i5™ SO that the zeroth order iteration yields Q'"' (eqn. 
I26p and the associated estimator, Pp"'. 

The first order Q down-weights sightlines that fall near 
one another. It is instructive to write the first order result 
using the simpler Jacobi iteration method. In the Jacobi 
method, the first order approximation for the is 

^ \cfr^{2Cf - C) [CfY'- (31) 

Being cavalier with the normalization, the next order 
weights in the Jacobi iteration scheme are given by 

Oj!L = WnWr.e'-^ '^- [l~Sn){l-S'^), (32) 

where 

Siik) = V — exp[-ifex -rij]. (33) 

The average of Si in a survey equals PF(fc)/[n^Pios]. If the 
number of quasars within |fcx • fijl < 1 of quasar i is larger 
than for other quasars, Si will also be larger, suppressing 
the weight given to this quasar. However, if all quasars have 
a similar number of quasars within \k± ■ rij\ < 1, then the 
Si would not vary strongly among the sightlines and this 
correction would be of minimal importance. 



4 SURVEY CONSIDERATIONS 
4.1 Galaxies 

The apparent flatness of the faint end of the quasar luminos- 
ity function does not facilitate using observations of quasars 
with L < L* to obtain a denser sample of Lya forest spec- 
tra. However, galaxies can be used to supplement in very 
deep surveys. The dashed curves in Figure [5] represent n^a 
for a spectroscopic galaxy survey a.t z — 2, 3, and 4 (from 

^ The inversion of the lower diagonal matrix L requires at most 
A''^ operations. 



4.2 Cross Correlation 

It is also possible to cross-correlate Lya forest skewers with 
some other tracer of large-scale structure. This could be 
done using a galaxy survey or with the quasars in the 
Lya survey themselves. We express the overdensity field of 
these tracers as 5g and their average 3D number density 
as Wg,3D. The average cross power between these signals is 
Ppg(fc) = {S{k) 5g{k)) and the variance on an estimate of 
this signal is 

var[Ppg(fe)] = PFg(fc)2 + Ptot(fc) (Pg(fe) + n-Jn) , (34) 

where Ppg is the cross-power spectrum between 5f and 5g 
and Pg is the auto-power of S^. If we again assume the weight 
factorizes, weights given by equation (|lip also maximize the 
S/N in this case. 

There are three scenarios for which the cross power 
could be an interesting measurement: (1) the cross- 
correlation is more sensitive to P]"^ than the Lya forest 
auto-correlation, (2) it could be used to separate different 
contributions to the Lya flux power or to measure the bias 
of the galaxies, or (3) it could be a systematic check for both 
the Lya and the other survey. 

Cross correlation is more sensitive than the Lya forest 
auto-correlation when 

ng,3D > n^a P^Z^ {k\^) b'^ /b^, (35) 

where we have assumed that the shot noise and aliasing 
terms are the dominant sources of variance for both surveys. 
For ficff = 10"^ Mpc"^, b = 0.3, and Pos = 0.5 (where 
the latter two numbers are characteristic of the z = 3 Lya 
forest), condition (|35|l becomes rig.sD > 2 x 10~^ Mpc~^ 
for a tracer with bg = 3. In addition, at scales where the 
other survey is instead sample variance-limited, the S/N in 
cross correlation will be at least as large as in the Lya forest 
survey's auto-correlation. 

Inequality (I35|l requires a number density that is a fac- 
tor of ~ 10 higher than the 3D quasar density that BOSS 
aims to measure. However, even though the S/N in the cross 
correlation is then a factor of ~ 10 below the S/N in the Lya 
forest auto correlation, this should be still be sufficient to 
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measure the quasar bias (and to much better precision than 
is possible with the quasar auto-power). A more promising 
route may be to cross correlate with a separate high-redshift 
galaxy survey. As an example, the Hobby-Eberly Telescope 
Dark Energy Experiment (HETDEXfl aims to find spec- 
troscopically 1 million Lya emitters between 1.8 < z < 3.8 
over 200 deg^, which works out to an average density of 
J^g.SD « 2 X 10~* Mpc"'' and easily satisfies criterion (|35p 
when correlated with a BOSS-like Lya forest survey. 

If criterion (I35|l is satisfied, the S/N is greater than or 
equal for the galaxy survey's own auto-power compared to 
its cross power with the Lya forest survey. Thus, the cross 
power may be most interesting as a systematics check as well 
as to investigate the sources of fiuctuations that contribute 
to 5f. 



4.3 Survey Strategy 

Figure [S] illustrates the survey strategy tradeoffs in the sky 
coverage versus magnitude-limit plane for z = 2.5. The black 
solid curves in both panels are contours of constant sensitiv- 
ity to Pf for a quasar survey. The black long-dashed curves 
are for a survey with quasars plus galaxies and at the same 
sensitivity as the black solid curves. The framed labels asso- 
ciated with each solid curve show the S/N of a measurement 
of fp.fi in a fc-space shell of width Ak/k — 0.2 . This shell is 
centered on A; = 0.1 Mpc~^ (left panel) or on fc = 0.5 Mpc~^ 
(right panel). 

The red short-dashed curves are contours of constant 
M = Survey Area/(limiting flux)^. For fixed duration of the 
observing campaign on an instrument, is a constant that 
does not depend on the survey strategy (assuming that the 
limiting fiux decreases as exposure time on a field to the 1/2 
power). The curves in Figure |B] correspond to A4 of 10~^, 
10"\ 1, 10, and 100 times M for the BOSS survey, whose 
approximate specifications are represented by the filled cir- 
cle (m^-g = mi^ = 22, .4 = 8000 deg^). For the present 
specific ations of BigBOSS, Ai is ~ 7 times larger than for 
BOSS. iMcDonald et al.l (|2005l 'l suggested an extension of 
the WFMOS spectrosco pic galaxy survey to include quasars 
l|Glazebrook et al.ll2005l ) and would reach m^'g ^ 24.5. WF- 
MOS is a proposed 300 deg'^ survey on the 8 m Subaru tele- 
scope. This hypothetical quasar survey would be only mod- 
erately more sensitive than BOSS at the considered scales. 

The left panel in Figure |6] argues that the survey strat- 
egy for BOSS is close to optimal in the sense of minimizing 
var[PF] at fc = 0.1 Mpc~^: The sensitivity contours have 
a similar shape to the red short-dashed curve that inter- 
sects the BOSS point. Similar conclusions hold for the sen- 
sitivity at fc = 0.5 Mpc~^ (right panel. Fig. In general, 
the optimal strategy for a quasar survey corresponds to in- 
tegrating deep enough on each field (and having enough 
fibers/slits) to reach quasars with luminosities of ~ L^,, 
where L« is the characteristic luminosity in the quasar lumi- 
nosity function. While Figure |S] assumes that m^B = itt-ab, 
the vertical axis can also be thought of as a function of fics, 
noting that jtiab = "^ab = {22, 24, 26} corresponds to 
fieff = {0.5, 3, 7} Mpc~^ at z = 2.5 for a survey that only 
includes quasars (Table |4}. 



At the considered wavevectors and fixed M, the sen- 
sitivity to Pf depends little on a survey's depth, at least 
for mii'g < 23. This resuh owes to the shape of the bright 
end of the luminosity function. However, we favor the survey 
strategy that is deep enough to reach down to L» because 
deeper quasar surveys will be able to better handle system- 
atics (because they have higher S/N per mode) and they 
rely less on the noise decreasing as the square root of the 
numbe r of modes (which can be invalidated by non-Gaussian 
effects; iMeiksinfc Whitelll999l ). Technically, a shallower but 
wider survey strategy will be more sensitive at smaller k 
than shown here because it samples more modes. 

The sensitivity gains of a deep survey that uses the spec- 
tra from galaxies in addition to quasars are not always sig- 
nificant. Compare the long-dashed and solid curves in Figure 
[6] The denser sample of sightlines that galaxies provide is 
more helpful at fc = 0.5 Mpc~^ compared to fc = 0.1 Mpc~^ 
(Fig. ini, but the sensitivity is never significantly improved 
over a quasar survey covering a larger fraction of the sky 
(fixing A4). Thus, the small gains in sensitivity from includ- 
ing galaxies may not be sufficient to outweigh the added 
difficulty of removing the galaxies' more complex continua. 

It is much more difficult to measure Pf at z = 4. How- 
ever, some of the science drivers of a measurement ai z — 4 
may not require a precision measurement, and even an or- 
der unity measuremen t at this redshift would be interesting 
( M cQuinn et al.ll2010l ') . Figure [7] quantifies the prospects for 
a measurement of Pf at z = 4. A spectroscopic quasar sur- 
vey reaching m}^ = 22 (the same depth as BOSS) would be 
able to achieve S/N « 3 at fc = 0.1 Mpc~^ in shells of width 
Ak/k — 0.2. A survey reaching mj^ — 23 would achieve 
S/N ^ 10. 

Thus far we have not discussed how to select quasars as 
a function of redshift: A survey with a multi-object spectro- 
graph would in practice have to decide whether to obtain the 
spectra of a quasar at z = 2 or z = 3, and this choice would 
affect its sensitivity. In the aliasing noise- limited regime, the 
total [S/N]'^ on a scale is oc J dzn^g. Because this integral is 
quadratic in Ucs, maximizing over a limited interval in 
redshift also maximizes the S/N. However, once sample vari- 
ance limits a survey at the scales of interest, it makes sense 
to broaden the redshift width of the quasar selection func- 
tion. In practice, a small fraction of slits or fibers per degree 
compared to the total number used in modern spectroscopic 
galaxy surveys is required to select all 2 > 2 quasars down to 
a survey's limiting flux (« 10 — 100). Therefore, the decision 
of which quasars to target may often be moot. 



4.4 Cosmological and Astrophysical Constraints 

Ultimately, one wants to use 3D Lya forest measurements 
to constrain cosmological parameters, ionizing background 
models, and the reionization history. This section briefly 
discusses how well these quantities can be constrained. 
To do so, we calcul ate the Fisher matrix deflned as (e.g., 
iTegmark eral]|l99iD 



dXidXi 



1^ 2 dP^k) dPHk) 



|http : / /www . as ■ utexas . edu/hetdex/ | 



(36) 

where C is the likelihood of the model given the data, the 
second equality assumes Gaussianity, and the \i are the pa- 
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Figure 6. Sensitivity to Pp at fc = 0.1 Mpc ^ (left panel) and k = 0.5 Mpc ^ (rigiit panel) as a function of survey area and limiting 
magnitude, where we have assumed that mj^™ = m)^, L oi 500 Mpc, and z = 2.5. The black solid curves are contours of constant 
sensitivity for a quasar survey and the partially overlapping black long-dashed curves are at the same sensitivity but for a survey that 
also includes galaxies down to the same magnitude. The framed labels associated with each solid curve quote the S/N on Pp.n in bins of 
width Ak/k = 0.2. The red circle represents the approximate specifications of the BOSS quasar survey. The red short-dashed curves are 
contours of constant A4 = Survey Area/ (limiting flux)^. For fixed duration of the observing campaign on an instrument, M does not 
depend on the survey strategy. The red short-dashed curves correspond to 10~^, 10~^, 1, 10, and 100 times the M of the BOSS survey. 




1.0 1.5 2.0 2.5 3.0 3.5 4.0 

Logio Area [deg^] 

Figure 7. Same as Figure [6] but at 2 = 4 and for fc = 0.1 Mpc~^ 



rameters we want to constrain. The summation is over all 
independent modes. Given this parameter set, the forecasted 
la uncertainty on the parameter Xi is [F~^]^^'^ . 

We perform our calculations for a survey a,t z — 2.5 and 
with a base parameter set that is given by the amplitude 
of Pf, its tilt and running with pivot at fc = 0.1 Mpc~^, 
the redshift-space distortion parameter g, the angular di- 
ameter distance Da, and the Hubble expansion parameter 



H{z). The former four parameters we treat as nuisance pa- 
rameters. Finally, we omit the first three line-of-sight modes 
since these are likely to be contamina ted by contintium an d 
mean flux errors, as motivated in iMcDonald et al.l l|2005h . 
The results depend weakly on this assumption. 

Some of the constraint from our base parameter set on 
Da{z) and H{z) owes not just to the baryon acoustic os- 
cillation (BAO) features, but also to Alcock-Paczynski type 
effects. Since it might be the case that the continuum un- 
der the BAO is contaminated by other effects, one may not 
want to use information that derives from this broad-band 
power. To guarantee that the constraint owes to the BAO 
features, we also provide a more conservative estimate in 
which we subtract from Pp a flux power spectrum that does 
not include the BAO prior to calculating dPp/dXi. In what 
follows, we quote both constraints. 

Figure |8] plots the constraints on Da{z) and H{z) at 
z = 2.5 as a function of fleff. The curves are for a survey 
with L = 1 Gpc and A = 10* deg'^. The thin curves repre- 
sent the optimistic predictions using the fiducial estimate for 
J^, and the thick curves are from the more conservative es- 
timate that subtracts the continuum underneath the BAO. 
A BOSS-like survey with A ^ lO" deg^ L ^ 1 Gpc, and 
fioff « 5 X 10~* Mpc~^ (n^AB = 22) can constrain Da at 
z = 2.5 to fractional precision of ~ 0.02 — 0.03, and one like 
BigBOSS where n^a « 1.5 x 10"^ Mpc'^ (m^l = 23) to 
a fractional precision of « 0.010 - 0.01513 The constraints 
we found on H{z) and Da{z) are comparable to those in 

^ The angular diameter distance is generally better constrained 
in surveys of large-scale structure for geometric reasons. However, 
the strength of redshift-space distortions in Lyman-a forest sur- 
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Figure 8. Fractional error on the angular diameter distance and 
Hubble expansion parameter at z = 2.5 as a function of ii^g. 
The curves are calculated for a survey with L = I Gpc and 
A = 10* deg^, but their amplitude scales as (L A)~^^^ ■ The thick 
curves represent a conservative estimate, and the thin curves rep- 
resent an optimistic estimate, as described in the text. 



iMcDonaldet al.l(|2005D . Figure [S] allows one to estimate the 
sensitivity of a survey of arbitrary volume since the error 
scales as [L A)~^^'^ ■ 

The precision at which Da(z) is measured from the 
Lya forest provides a constraint on cosmological parame- 
ters, such as the curvature space density, fifc. In particular, 
a measurement of Da{z) at z « 2.5 differenced with its 
value a,t z — 1100 - which is tightly constrained by cosmic 
microwave background measurements - provides a robust 
measure of Qk (at least under the assumption that dark 
energy is negligible at z > 2). If the principle uncertainty 
is in the angular diameter distance to the Lya forest, as 
is likely to be the case, this translates to the constraint 
Silk ~ 0.26 SDa/ Da at z = 2.5. Thus, a measurement of 
Qk to the precision ~ 3 x 10"'^ is possible with a BigBOSS- 
like survey, an improvement ove r the present-day lim it of 
l^jf I < 1% that assumes ACDM (|Komatsu et al.H201 j ). 

Ionizing background and temperature fluctuations can 
also be measured in a 3D Lya forest analysis. To gauge how 
well these contributions can be constrained, in addition to 
our base set of parameters, we also include a scale-invariant 
contribution to Pf that scales component that 

scales as and a white component that scales as k'^ (all 
spherically symmetric). The contribution that scales as 
is motivated by the expected scaling for intensity fluctu- 
ations at wavevectors larger than one over the mean free 
path of 1 Ry photons (essentially all accessible k a,t z — 2.5; 
iMcQuinn et al.ll201(t ). and the scale-invaria nt form approx- 
imate s some of the temperature models in iMcQuinn et al.l 
We have set the normalization of each component to 
be such that it is the same as in our simple model for Pf at 
(fc|| , A:^) = (0.1, 0) Mpc~^. This normalization is comparable 
to that of physically motivated mo dels for temperature an d 
intensity fluctuations presented in IMcQuinn et al.l l|2010l ). 
For a BOSS-like survey, the normalization of the k~^ com- 



veys enhances the line-of-sight power, boosting the sensitivity to 
H{z). 



ponent is constrained to 10%, the normalization of the k~^ 
component to 3%, and the normalization of the white com- 
ponent to 0.03%. We have also examined the constraints for 
a shallower survey with fioff = 10~* Mpc~^, L = I Gpc, 
and A — 10* deg^ (as could be obtained by BigBOSS if it 
targets z ~ 4). This case yields 40% and 12% constraints on 
the k~^ and k~^ components, respectively. 



5 SYSTEMATIC CONCERNS 

This section briefly discusses systematics that are known to 
be serious for line-of-sight Lya forest analyses in the context 
of 3D analyses. We focus on continuum fltting errors, mis- 
takes in the mean flux estimate, and the damping wings of 
high column-density absorption systems. A pervading theme 
of this discussion is the advantages of a 3D analysis in deal- 
ing with these systematics. 

Measurements of Ppik) must remove the structure in 
the quasar continuum in order to achieve an unbiased mea- 
surement of Sp. However, continuum fluctuations do not 
have to be removed perfectly since they only act as a source 
of noise as long as the error in subtr acting them doe s not 
correlate between different sightlines (|Viel et al]|2002h . (In 
addition, any systematic error introduced in this removal can 
be excised at a later step as described below.) The extra con- 
tinuum power that does not correlated between the different 
continua, Pio™*, would contribute an extra term in Ptot equal 
to n~^Pioa°*, where again var[PF] = 2Ptot, assuming that 
line-of-sight corr e lation s are omitted. At fcy > 0.1 Mpc^^, 
iMcDonald eFal] l|2005l l found that the mean quasar con- 
tinuum had power equal to < 1% of Pp. Thus, contin- 
uum fluctuations will not significantly increase the vari- 
ance (by increasing the amount of line-of-sight power) at 
fc|| > 0.1 Mpc~^, but could be of more importance at smaller 
wavevectors. 

The redshift evolution of the mean flux (P) - which 
can be quite substantial over L ~ 500 Mpc - can also con- 
tribute spurious power at small fcy . Accurately estimating 
(P) is known to be crucial for interpreting line-of-sight mea- 
surements. It also has the potential to be an even more im- 
portant issue for 3D measurements, where the size of fluc- 
tuations one aims to measure are signiflcantly smaller. For 



example, the dimensionless 3D power at fc = 0.1 Mpc~ is 
A| ~ 10~*, which would require that the mean flux evolu- 
tion be known to precision <C 10~^ in order to not bias the 
measurement of Pf averaged over a shell centered at this 
k. (Although, the power from the mean flux error would 
primarily impact purely line-of-sight modes.) Traditionally, 
(P) is estimated by averaging the transmission at redshift z 
from all the sightlines in a survey. Given N sightlines and 
if we want to estimate (P) in a region of size Ax, then this 
quantity can be estimated to precision 



2 



(Plos + Pn 



(37) 



Noting that the numerator on the right hand side is 0(1), it 
takes 1000 Lya forest spectra to constrain a^p-^ in a 10 Mpc 
region to reach the threshold of 10~^, which is much less 
than the total number of Lya forest spectra in BOSS or 
BigBOSS. 

However, stacking to obtain (P) requires that no mis- 



© 2009 RAS, MNRAS 000, 1 



12 M. McQuinn and M. White 



takes are made. For example, a systematic error in the con- 
tinuum subtraction can lead to a biased estimate of (_F). 
Fortunately, in 3D analyses, one has the ability to discern 
these effects at a later step in the analysis. A mean flux er- 
ror will principally lead to spurious power that only affects 
purely line-of-sight modes. Thus, one can simply throw away 
the first X modes along the origin in the analysis to remove 
these errors. In addition, any bleeding to other modes owing 
to the complicated survey window may not be so worrisome 
because the error in the mean flux is at the percent-level such 
that the total variance contributed by this error is compara- 
ble to A|. . For example, the power in this bleeding for the Pp 
estimator described in Section[2]is suppressed by N~^^^ rel- 
ative to the mean flux-error power in the line-of-sight modes. 
Errors in the mean flux also enter in convolution with Pp, 
but this effect is less of a contaminant again given that the 
sizes of these errors are percent-level over x ~ 100 Mpc. For 
such an error, a percent of 5f is smoothed over Afcy ~ x~^. 

Another systematic is the damping wing absorption 
from Lyman-limit and damped Lyman-a systems. This ab- 
sorption is generally not included in simulations of the Lya 
forest, and it arises from dense, self-shielding systems that 
are not captu red properly in almost all cosmological simula- 
tions anyway (|Katz et al.ill996l : lMcQuinn et al.ll201ll '). These 
systems could alter the line-of-sight powe r at the 10% level 
at fell « 0.1 Mpc"^ (|McDonald et al.ll2005l '). and at the 100% 
level at fcy = 0.01 Mpc"^ (Appendix H]). 

Appendix [B] discusses a simple model for the power in 
these systems that qualitatively reproduc es the numerical 
estima tes for the effect of damping wings in lMcDonald et al] 
and that allows us to estimate their impact for 3D 
analyses. We show that much of the line-of-sight power from 
damping wings arises from the uncorrelated (shot) compo- 
nent of their power. The shot contribution to the 3D flux 
power from damping wings is much smaller than in the line- 
of-sight flux power spectrum, and, thus, the role of damping 
wings as a contaminant is reduced (Appendix |B]). However, 
we show that they still are likely to add power to Pp at the 
tens of percent level. 



6 CONCLUSIONS 

This paper studied issues relevant to upcoming 3D Lya for- 
est surveys. We derived a simple formula for how to opti- 
mally weight sightlines with varying S/N levels. We found 
that simply weighting sightline i by (1 + cr^,i/o"iis)~^, where 
I and o-f^g are respectively the variance of the noise and 
of the Lya forest normalized flux in a 10 Mpc region or 
greater, performs nearly as well as the weights specifled by 
the minimum variance quadratic estimator. These weights 
should be simple to apply to data even in the presence of 
real-world complications. We derived an expansion (which 
involves only matrix multiplications) that converges to the 
minimum variance quadratic estimator and for which our 
suggested weights are the lowest order contribution. We 
showed that the next term in this expansion has the intu- 
itive behavior of suppressing the contribution from quasars 
that have an overabundance within r_i_ <k~[^ . 

We showed that the sensitivity of a spectroscopic sur- 
vey to the Lyof forest flux power spectrum can be quanti- 
fled by just a single number, ficfi - a noise-weighted num- 



ber density of sources on the sky - so that var[PF(fc)] = 
2 {Pp-\-n~^ Pios)^. While this number technically depends on 
the fc|| of the wavevector in question, in practice this depen- 
dence is extremely weak at fcy < 0.5 Mpc~^ because Pios is 
almost constant over these wavevectors. These are the same 
wavevectors at which 3D Lya surveys have the potential to 
derive competitive constraints on cosmological parameters 
(McDonal d fc Eisen stcin 2007) and that are the most inter- 
esting for stud ying astrophysical sou rces of fluctuations in 
the Lya forest jMcQuinn et al.ll2010l '). 

We calculated neSf as a function of survey speciflcations 
for both quasar and galaxy surveys at different redshifts. For 
quasar surveys, it is difficult to achieve significantly higher 
Weff than ~ 3 X 10""^ Mpc~^ (or ~ 30 per deg^) at any red- 
shift owing to the shallowness of the faint end of the quasar 
luminosity function. A survey with ficff = 3 x 10"'^ Mpc~^ 
is aliasing noise-limited at > 0.1 Mpc~^. In this limit, the 
S/N on Pp scales linearly with n^s. 

This paper also discussed survey strategy tradeoffs. The 
previous results allowed for a simpler characterization of 
these tradeoffs than in prior studies. We showed that a 
survey's sensitivity to the flux correlation function is maxi- 
mized with the strategy of integrating on each field just long 
enough to achieve S/N « 2 in a 1 A pixel for an L, quasar. 
However, we found that a shallower strategy but covering a 
wider field formally obtains a comparable sensitivity to Pp 
at 0.1 < < 0.5 Mpc"^ (but with lower S/N per mode), 
and it could even be more sensitive at smaller fc. While such 
low S/N values may cause worry regarding the efficacy of 
continuum subtraction, we further argued that continuum 
removal errors as well as other systematics are likely to be 
less problematic in 3D correlation analyses compared to in 
ID. Lastly, we found that using the spectra from galaxies 
never results in the sensitivity being vastly improved over a 
shallower but wider survey that only incorporates quasars. 

We quantified as a function of ficff a survey's sensitivity 
to the angular diameter distance at the mean redshift 
of the survey, to the Hubble expansion rate, and to the 
sources of fluctuations that do not trace density. Upcoming 
surveys have the potential to measure Da(z) and H[z) to 
1 — 2%, which would substantially improve constraints on 
the curvature of space-time and early dark energy models. 
These surveys also have the potential to detect other 
contributions to the flux power at the percent-level and, 
thereby, constrain the level of temperature fluctuations 
relic from reionization processes and the properties of the 
extragalactic ionizing background. 

We thank Anze Slosar for helpful comments on this 
manuscript, and Shirley Ho, Adam Lidz, Nic Ross, Jennifer 
Yeh, and Matias Zaldarriaga for useful discussions. MM is 
supported by the NASA Einstein Fellowship. 
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APPENDIX A: POWER-LAW LUMINOSITY 
FUNCTIONS 

We gain some insight into fioff by imagining that the distri- 
bution of observed fluxes for quasars in the target redshift 
range is a power law. 



No f f 



(a> 1), 



fo V/o 

and that we observe all quasars above a given flux limit /, 



(Al) 



No ff^y'\ (A2) 



/mi„ ./o V /() / a - 1 V fo 

If observing conditions are such that the noise in the forest 
for a quasar at flux fo has Pjv,n ~ o"o -Pios and scales as 
then 



rieff = no 



(///o) 



l + a2(/o//)2 



fo' 



(A3) 



where no = No/ A. The behavior of the integral is controlled 
by "I> = cro/o//min; for general a it can be written in terms of 
a hypergeometric function, however for integer a it reduces 
to elementary functions. For example, when a = 2 



ncff = no 



tan"^ $ 



while for a = 3 



ncff = no 



ln(l+_$^ 



(A4) 



(A5) 



and larger a give more negative powers of ao and combina- 
tions of logs or arctangents. 

A natural choice for fo is to make cro = 1, so that no is 
the number of quasars with Pjv = Pios and <E> measures the 
minimum flux in units of the characteristic flux. It is easy 
to see that decreasing /min, i.e. increasing $, leads to larger 
fioff but that the gains are small once $ ^ 1. For /min ~ fo 
n-cff is no up to a numerical constant of order unity, which 
reinforces the discussion in the text. 



APPENDIX B: DAMPED ABSORBERS 

As briefly discussed in Section [5] an important systematic 
for Lya forest analyses is the damping wing absorption from 
Lyman-limit and damped Lyman-a systems. A simple halo- 
like model for the contribution to the 3D power of the ab- 
sorption from these systems (applicable at scales larger than 
those affected by thermal broadening) is 



oDLA 



W^2(fc||)'(6DLA + /i')'P""(fe) 



(Bl) 



^•10=^* = m(fc||) + lV2(fc||)' |^fcx(6DLA + M')'j'r(fc), 

where we have neglected the shot noise component in Pp^^ 
because it is small on relevant scales (having amplitude ~ 
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Figure Bl. Ratio of different components of the power in damp- 
ing wings to the Lya forest power without damping wing absorp- 
tion as a function of fcy and at z = 2.5. The top panel is for 
the line-of-sight power spectrum and the bottom is for the 3D 
power spectrum. The thick solid, thick short-dashed, and thick 
long-dashed curves are respectively the 1-DLA term, the 2-DLA 
term, and the Ly« forest— DLA cross power. The corresponding 
thin curves are the same but are calculated only using systems 
with A^Hi > 10^'' cm- 2. 

1 Mpc~^, which is much less than b^L^. ^i'") ^-nd 

W,m) = I dNm ^^^^d{k^^ f, (B2) 

W^.(fc||) = /diVH:^^d>,l). (B3) 

The function d is the Fourier transform of the damping 
wing absorption profile, 1 — exp[— ft;^A^Hi/(x ~ Xo)^], X is 
the conformal distance, and xo is the value of x at line- 
center. We will refer to the first and second terms in the 
equation for Pi^l'^ as the 1-DLA and 2-DLA terms, respec- 
tively. For details, this is analogous to how the dark matter 
po wer spectrum in the hal o model is calculated, as reviewed 
in lCoorav fc ShethI (|2002l ). In addition, 6dla is the bias of 
such systems (taken to be 2 here), d^JV/dxdNm is the num- 
ber of systems per x per A'hi, and — 5x 10"^' cm^'^ Mpc^ 
at 2 = 3. At small fcy, W2 becomes equal to the fraction of 
the continuum that is absorbed by damping wings. Finally, 
the total effect of damping wings and un-damped Lya forest 
absorption on the full 3D fiux power spectrum is 

Pf' ^\b + boLAmm )] Pf ^ + higher order. (B4) 

Figure IBll plots the fraction of power that originates 



from damped systems at z = 2.5. These calculations assume 
that f(N-ai,X) = d'^Af/dzdNm^^M /{I + z) integrates to 
0.1 between 10^^ and 10^°'^ cm ^ , is a power- law with index 
of -1.2 at iVm < 10^"'^ cm-^ and with index of -1 .8 at 
higher columns, as mo tivated in lO'Meara et al.l (|2007| ) and 
iProchaska et alj l|2010l 'l. The function /(A''hi, X) is uncertain 
at the factor of 2-level. 

The top panel in Figure iBll shows the ratio of the line- 
of-sight power in damping wings to Pios, and the bottom is 
for the same but for the 3D power. The black thick solid, 
red thick short-dashed, and blue thick long-dashed curves 
are respectively the 1-DLA, 2-DLA terms, and cross-power 
terms. The 1-DLA component is only shown in the top panel 
because it is a subdominant contribution to the 3D power 
at the plotted scales owing to the high 3D number den- 
sity of these systems. The 1-DLA term is a more important 
contribution than the 2-DLA to Pio'i, and the cross-power 
term with the forest, 2 h &dla VK2Pa'", is important at higher 
fc||. However, the cross term is the most important contri- 
bution to the 3D forest power (bottom panel). The thin 
dashed curves are the same as the thick curves but only 
include systems with A'hi > 10'^° cm"'^ (the contribution 
that is easiest to remove in pre-processing). Systems with 
A'hi > 10^° cm"^ contribute most of the 1-DLA power in 
^los"^ at the smallest fc, but contribute little of the 1-DLA 
power at fc > 0.1 Mpc"'^, of the 2-DLA power in Pi^l and in 
Pp", or of the cross power. 

The morphology and amplitude of the line-of-sight 
curves in the top panel of Figure IBll are similar to what 
was fo und in the numerical calculations of iMcDonald et al] 
(200 51), who considered fc || > 0.1 Mpc"'^ (see their Fig. 
2). ;McDonald et al.ll2005l found a plateau at the highest fc 
shown in Fig. IBll that likely owes to the higher order terms, 
such as Piog""^ ★ Pioa ■] Our model ignores the coincidence be- 
tween the normal Lya absorption and the damping wing ab- 
sorption: Damped regions occur where ther e is already more 
absorption by un-damped Lya absorption. iMcDonald et al.) 
(|2005h found that this correlation suppresses the impact of 
damping wing absorption by a factor that can be as large 
as 2. 

Our toy model for the impact of damping wings provides 
a couple insights. First, it shows that there is little point of 
removing the contribution from the highest column systems 
in 3D analyses of the forest because they contribute a small 
component of the total power from damping wings. Second, 
because the 1-DLA term is unimportant in 3D, this reduces 
the amplitude of the contribution from damped systems rel- 
ative to ID analyses, especially at the smallest wavevectors. 
However, we predict that the contribution from damping 
wings to Pf is still non-negligible. Lastly, in a 3D survey 
one has the ability to fit for the the damping wing contri- 
bution: It should scale at relevant scales as C(fc||) P]'"(fc), 
where C is some function that only depends on fey. 
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